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Abstract 

In this paper, we outline the prototype of an auto- 
mated inference tool, called QUIP, which provides a 
uniform implementation for several nonmonotonic rea- 
soning formalisms. The theoretical basis of QUIP is de- 
rived from well-known results about the computational 
complexity of nonmonotonic logics and exploits a rep- 
resentation of the different reasoning tasks in terms of 
quantified boolean formulae (QBFs). 



General Information 

Designing theorem provers for nonmonotonic reason- 
ing formalisms is challenged by the increased compu- 
tational complexity compared to the inherent c omplex- 
ity o f classical reasoning. A s is well-known ( Gottlot 
1992|; lEiter fc Gottlob 1993t [Eiter fc Gottlob 1995a| ; 
Eiter fe Gottlob 1995b| ), the main reasoning tasks for 
almost all propositional nonmonotonic logics are either 
£ 2 -complete or n 2 -complete, i.e., they are located at 
the second level of the polynomial hierarchy, whereas 
classical reasoning resides at the first level of that hier- 
archy. 

Important constituents in establishing the 
completeness results for the different nonmonotonic rea- 
soning tasks are so-called quantified boolean formulae 
(QBFs), which generalize ordinary propositional formu- 
lae by the admission of quantifiers ranging over propo- 
sitional variables. To wit, demonstrating E 2 -hardness 
of a given nonmonotonic decision problem, say NMP, 
is usually achieved by reducing the problem QSAT 2 
to NMP, where QSAT 2 is the problem of deciding the 
truth of QBFs being in prenex normal-form whose lead- 
ing quantifiers are ordered like 3P VQ (P, Q are lists 
of 



yP 



: pji'oposiliuual variables), which is complete for E 2 
n general, 



variables,), wl 

(In general, for any class E?, i > 1, there is a eor- 
responding decision problem, QSATj , complete for 
whose task is to check the truth of QBFs of the form 
3Pi VP 2 . . . 3P„-i VP„ $, where P x , . . . , P„ are lists of 
propositional variables and $ is a propositional for- 
mula.) More precisely, E 2 -hardness of nmp is shown 
by constructing a (polynomial) transformation S such 



that for every instance I of QSAT 2 it holds that I is a 
yes-instance of QSAT 2 iff S(I) is a yes-instance of NMP. 

Now, given the membership of a nonmonotonic rea- 
soning task NMP in the class E 2 , and from the E 2 - 
hardness of QSAT 2 , it follows immediately that every 
such decision problem NMP can be reduced to QSAT 2 , 
i.e., there is a (polynomial) transformation T such that 
for every instance I of nmp it holds that I is a yes- 
instance of nmp iff T(I) is a yes-instance of QSAT 2 . 

In t his paper, we des c ribe the prototype system 
QUIP ([Egly et al. 20004 [Egly et al. 2000b), an au- 



tomated reasoning tool utilizing transformations of the 
latter kind to implement several different reasoning for- 
malisms. The basic idea is to employ existing sophis- 
ticated theorem provers for quantified boolean formu- 
lae, taking care of evaluating the resultant instances 
of QSAT 2 . 

At present, QUIP handles the following propositional 
nonmonotonic reasoning approaches: 

• abduction; 

• autoepistemic logic; 

• default logic; 

• disjunctive stable model semantics; 

• circumscription. 

The system has been implemented in C using stan- 
dard tools like LEX and YACC (comprising a total of 
2000 lines of code, excluding the used QBF-solver); it 
runs currently in a Unix environment (Sun/Solaris and 
Linux), but is easily portable to other operating sys- 
tems as well. 

The use of QBFs expr essing advanced reasoning tasks 
has been advocated in ( |Cadoli, Giovanardi, fc Schaeri 



1998[ [Rintancn 1999b) ) and is a natural generalization 
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of a similar method applied for problems in NP. Such 
problems are often solved by a reduction to SAT, the 
satisfiability problem of classic al propositional logic , 
which is NP-complete (see, e.g., ( Kautz fc Selman 199(j ; 
Giunchiglia fc Sebastiani 1996| )). Besides the applica- 
tions discussed in this paper, a redu ction of plannin g 
problems to QBFs has been given in (Rintancn 1999a). 

In general, since the evaluation of arbitrary QBFs is 
PSPACE-complete (in contrast to the E^-completeness 



of the restricted QBFs mentioned above), in principle 
any formalism having a decision problem in P SPACE 
can be handled by QUIP, provided a proper transfor- 
mation has been found. 

The next section outlines the overall architecture of 



QUlK and gives some background information on the 
different reasoning tasks implemented m QUIP. Then, 
the usability of the approach is described. The last 
section contains a discussion on benchmark problems 
and a comparison with other implementations. 

Description of the System 
System Architecture 

The overall architecture of QUIP is depicted in Figure [l]. 
QUIP consists of three parts, namely the filter pro- 
gram, the QBF-evaluator boole, and the output inter- 
preter int. 

The input filter translates the given problem de- 
scription (e.g., a default theory, a disjunctive logic 
program, an abductive problem, etc.) into a quanti- 
fied boolean formula, which is then sent to the QBF- 
evaluator boole. The result of boole, usually a for- 
mula in disjunctive normal form (often called sum of 
products, SOP), is interpreted by int. The latter part 
associates a meaningful interpretation to the formulae 
occurring in SOP and provides an explanation in terms 
of the underlying problem instance (e.g., an extension, 
a stable model, abductive explanations, etc.). The in- 
terpretation relics on a mapping of internal variables 
of the generated QBF into concepts of the problem de- 
scription which is provided by filter. 

The QBF-evaluator boole is a commonly available 
propositional theo rem prover b ased on binary decision 
diagrams (BDDs) ( Bryant 1986| ).P| We chose to use this 
particular tool because of several reasons. For one, 
the program, together with its source code, is in the 
publ ic domain and can be downloaded from the we b 



(see http:/ /www. cs. emu. edu/^modclchcck/bdd. html). 
Second, it represents a sophisticated reasoning engine 
with several years of development. Third, it does not 
require of the input formula to be in a specific nor- 
mal form. The latter point distinguishes boole from 



oth er QBF-solvers, like those proposed in ( Cadoli, Gio- 
vamxdi, & Schacrf 1998; Rintanen 1999b), which op- 



erate on formulae being in prenex conjunctive normal 
form. Although these provers can in principle also be 
employed in QUIP, their inclusion would require an ad- 
ditional normal form translation, since the "natural" 
reductions of nonmonotonic reasoning tasks to QBFs 
do in general not result in formulae being in a particu- 
lar normal form. 

In order to incorporate new formalisms into QUIP, 
one has to extend the filter program responsible for 
the appropriate reductions, the mapping of the vari- 
ables, and the interpreter int. The deductive engine 
remains unchanged in this process. 



Implemented Reasoning Tasks 

In this subsection, we briefly discuss the different rea- 
soning tasks currently implemented in QUIP. We will 
not present the concrete transformations int o QBFs ex- 



pressin g these tasks; details are given in (Egly et al 
2000b| ) 



All formalisms processed by QUIP are propositional. 
In what follows, we assume a propositional language £ 
generated by a finite set of propositional variables V 
using the standard sentential connectives ->, A , V , — 
and <->. A theory is a finite set T C £, which will be 
identified with the formula A^gt c t ) - 

Note that, strictly speaking, the different decision 
problems presented below return either yes or no, but 
usually one is more interested in the corresponding 
function problem returning the actual objects responsi- 
ble for a yes answer (like the extensions of a default the- 
ory, the stable models of a logic program, etc.). QUIP 
permits queries of this form, by engaging the so-called 
detail mode (see discussion below) . 

Abduction. Classical abd uction from a theory T on 
V may be defined as follows (gclman fc Levesque 199C| ; 
Poole 1989| ; |Eitcr fc Gottlob 1995b| ). Let H C V be 

a set of hypotheses, and let p € V be a distinguished 
atom. A subset E C H is an abductive explanation for 
p from T and H , if 

(i) T U E is consistent, and 

(ii) T U E \= p, i.e., TUB logically implies p. 

An explanation E is minimal, if no proper subset E' C 
E is an abductive explanation. 

The following tasks are implemented in QUIP: 

• Given a theory T, a set H of variables, and an atom 
p. Is there a (minimal) abductive explanation E C H 
for p from T and HI 

• The relevance problem: Given a theory T , a set H of 
variables, an atom p, and some hypothesis h e H . Is 
there a (minimal) abductive explanation E C H for 
p from T and H containing h? 

• The necessity problem: Given a theory T, a set H of 
variables, an atom p, and some hypothesis h G H. 
Does h occur in every (minimal) abductive explana- 
tion E C H for p from T and HI 

Autoepistemic logic. The language of Moore's au 



1 For a comparison between different BDD packages, 
cf. ( |Long 199^ ). 



toepistemic logic (Moore 1985) contains the modal op- 
erator L, where L<p intuitively means that <f> is believed. 
By £i we denote the language £ extended by L. For- 
mulae L(p are viewed as propositional variables, which 
are called modal atoms. 

A stable expansion of an autoepistemic theory T C 
£l is a set E C £ L such that 

E = Th(T U {L(j) \<j)£E)\J {^L<j) \ <j) £ E}), 

where Th(-) is the classical consequence operator with 
respect to the extended language £l. 
QUIP handles the following tasks: 



Figure 1: QUIP's system architecture. 



• Given an autoepistemic theory T C £ L , is there a 
stable expansion E of T? 

• Brave reasoning: Given an autoepistemic theory T C 
Cl and some formula $, is there a stable expansion 
E of T containing $? 

• Skeptical reasoning: Given an autoepistemic theory 
T C Cl and some formula $, is $ contained in every 
stable expansion E of Tl 

For brave and skeptical reasoning, if the detail mode is 
engaged, QUIP returns witnesses corresponding to the- 
ses tasks: for brave reasoning, all stable expansions con- 
taining $ are returned, whereas for skeptical reasoning, 
all stable expansions not containing <I> are returned. 

Default logic. A default theory is a pair A = (T, D), 
where T C £ is a set of formulae and D is a set of 
defaults of the form - - 13 Intuitively, the default is 
applied (7 is concluded) if a is provable and the justi- 
fication [3 can be consistently assumed. 

The semanti cs of A = {T , D) is defined in terms 
of extensions (Reitcr 1980). Following (Marek & 



Tru; izczyriski 1993), extensions can be characterized 
thus. For any S C C, let D(S) be the monotonic rules 



{- 



€ D,->/3 £ S}. Then, E C C is an exten- 



sion of A iff E = Th D( - E \T), where Th D ^{T) is the 
set of all formulae derivable from T using classical logic 
together with the rules from D(E). 

QUIP expresses the following reasoning tasks: 

• Given a default theory A, is there an extension E 
of A? 

• Brave reasoning: Given a default theory A and some 
formula <&, is there an extension E of A containing <i>? 

• Skeptical reasoning: Given a default theory A and 
some formula $, is $ contained in every extension E 
of A? 

As for stable expansions, QUIP returns witnesses if 
the detail mode is engaged. Moreover, each of these 
reasoning tasks has been implemented in terms of two 
independent transformations: The first category of re- 
ductions is based on the characterization of extensions 
discussed above; the second category is based on a 
character ization of ext ensions using the notion of a 
full set (Niemela 1995). Interestingly, although the 
transformations based on the latter method arc more 



succinct than the corresponding reductions of the first 
kind, they have often an inferior performance compared 
to the former transformations. 

Disjunctive Logic Programming. A disjunctive 
logic program, II, is a set of rules 

r: H(r) <- P(r),N{r) 

where H(r) is a disjunction of variables, P(r) is a 
conjunction of variables, and N(r) is a conjunction 
of negated variables. A H erbrand interpretation I of 
V is a stable mod el of II ( Gclfond fc Lifschitz 198§| ; 
Przymusinski 1991), if it is a minimal model (with rc- 
spect to set-inclusion) of the program II 1 resulting from 
II as follows: remove each rule r such that I \= a for 
some ->a in N(r), and remove N(r) from all remaining 
clauses. 

Similar to autoepistemic logic and default logic, QUIP 
handles the problem whether a given logic program has 
a stable model, as well as brave and skeptical reasoning. 

Circumscription. In contrast to the formalisms de- 
scribed above, propositional circumscription is already 
a quantified boolean formula, hence it does not require 
a separate reduction. So, QUIP can handle circumscrip- 
tion in a straightforward way. 

In the propositional case, the parallel circumscription 
of a set of atoms P = {pi, . . . ,p n } in a theory T, where 
the atoms Q are fixed and the remaining atoms Z = 
{zi, . . . , z m } = V \ (P U Q) may v ary, is given by the 
following QBF CIRC(T;P,Z), cf. ([Lifschitz 1985h: 



Tf\\/P'VZ'((T[P/P',Z/Z'}A(P' < P)) -> (P < P'] 



For simplicity, we omit multiple justifications here. 



Here, P' — {p[, ...,p' n } and Z' = {z[, , , . ,z' m } are sets 
of new propositional variables corresponding to P and 
Z, respectively, and T[P/P',Z/Z'] results from T by 
substitution of the variables in P'UZ' for those in P\JZ. 
QUIP implements circumscriptive inference of a formula 
4> from T, which is expressed by the QBF 

VV(CIRC(T;P,Z) -> cfy. 

Applying the System 
Methodology 

One of the basic motivations for the development of 
QUIP was to make a rapid prototyping tool avail- 
able, aimed for experimenting with different knowledge- 
representation formalisms. Accordingly, QUIP is de- 
signed in such a way that the important reasoning tasks 



corresponding to the different formalisms under con- 
sideration can be directly encoded as queries. So, the 
methodology of specifying queries suitable to be pro- 
cessed by QUIP is the same as the methodology of 
formalizing a particular problem with one of the for- 
malisms implemented in QUIP (subject to the restric- 
tion, of course, that QUIP currently accepts only queries 
specified over a propositional language; but in a future 
version it is planned to extend the language to allow 
function-free formulae with variables as well.) 

Specifics 

Let us illustrate how the queries of QUIP are structured. 
Generally, QUIP takes an input file as argument and 
writes the output to standard-out: 

quip input_file 

The input file comprises nonmonotonic theories and 
specifications of reasoning tasks. The format of the 
input uses two major concepts: definitions and com- 
mands. In the definitions, one can specify abductive 
theories, autoepistemic theories, default theories, logic 
programs, and propositional theories in general. The 
commands specify which reasoning tasks (with respect 
to the chosen formalism) have to be executed. Theories 
and formulas can be nested by using suitable names, 
so definitions can be edited rather conveniently. Fur- 
ther, commands can refer to specified theories without 
the need to describe them more than once. An input 
file can contain several definitions and commands, even 
referring to different formalisms. 

In the following we describe some of these features 
using a simple example from default logic. Consider 
the following default theory A = (T, D) , representing 
the well-known Nixon-diamond: 

T = {Republican, Quaker}] 

Q ( Republican : -^Pacifist Quaker : Pacifist 1 

\ -^Pacifist Pacifist J ' 

This default theory has two extensions: 

Th({ Republican, Quaker} U {^Pacifist}); 

Th({ Republican, Quaker} U {Pacifist}). 

The extensions of this example can be computed by 
QUIP using the following input file, named nixon: 

@SET DETAIL 

T := Republican k Quaker 
@D D := { Republican : IPacifist I IPacifist, 
Quaker : Pacifist I Pacifist 

} 

@DL ( T ; @D D ) 

Executing the command 

quip nixon 

results in the following output: 

Th( { (Republican) & (Quaker)} u {(Pacifist)} ) 
Th( { (Republican) & (Quaker)} u {(IPacifist)} ) 



The meaning of the different commands and defini- 
tions in the input file nixon can be explained as follows. 
First of all, the command ©SET DETAIL invokes the de- 
tail mode, i.e., all extensions will be displayed. (Recall 
that QUIP admits the processing of both decision prob- 
lems and function problems.) Choosing the command 
SUNSET DETAIL would have resulted in a simple yes/no 
answer. 

The next tokens of the input file specify the con- 
stituents of the default theory A: T represents the back- 
ground knowledge of A, and D contains the defaults. To 
avoid ambiguities, references to defaults always have to 
start with a special string "@D". The logical operators 
are represented in the obvious way ("&" denotes con- 
junction, "!" negation, and "I" separates a default's 
consequent from its justification). 

The command @DL tells QUIP to compute the exten- 
sions of the specified default theory. To perform brave 
or skeptical reasoning, the input file nixon would be 
changed as follows: 

©SET DETAIL 

T := Republican k Quaker 
@D D := { Republican : IPacifist I IPacifist, 
Quaker : Pacifist I Pacifist 

} 

OSET BRAVE 

@DL ( V ; ID D ) |= Pacifist 
©SET SKEPTICAL 

@DL ( V ; ID D ) |= Pacifist 

The SSET-commands switches between the two rea- 
soning modes; " I =" is the symbol standing for the re- 
spective default consequence relations. The first com- 
mand checks whether Pacifist is a brave consequence of 
A, the second commands checks skeptical consequence 
of Pacifist from A. Observe that it is permitted that 
an input file contains a sequence of reasoning tasks. 

The output corresponding to the first command will 

be 

Th( { (Republican) k (Quaker)} u {(Pacifist)} ) 

while the second command displays the extensions from 
which Pacifist docs not follow: 

Th( { (Republican) k (Quaker)} u {(IPacifist)} ) 

Users and Usability 

Obviously, proper usage of QUIP depends on a poten- 
tial user's ability to express a given problem in terms 
of the implemented formalisms. However, a particular 
advantage of QUIP is that it incorporates several differ- 
ent knowledge representation formalisms. Hence, users 
have a choice selecting among different (yet closely 
related) approaches, singling out that particular for- 
malism best suited for a specific purpose, or choosing 
a method based on a user's personal preference (e.g., 
because he/she understands that particular approach 
best). As well, the problem can be represented with dif- 
ferent methods simultaneously, specifying the instances 
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Table 1: Performance Results 



of the resultant formalizations in a single input file, 
which can then be processed by QUIP requiring only 
one initial execution command. 

Evaluating the System 
Benchmarks 

There are different approaches how systems handling 
knowledge representation formalisms can be evaluated. 
One is to perform comparisons taking into account the 
representational power of the implemented formalisms. 
That is to say, under such a comparison, one chooses 
some "natural" problems, encodes it with respect to 
the specific methodologies associated with the imple- 
mented formalisms, and uses the resultant instances as 
queries of the respective systems. So, basically, different 
systems are compared on the basis of (possibly) differ- 
ent representations of the same problem. However, to 
achieve a fair comparison, it is necessary that the ex- 
perimenter is able to encode the given problem "in the 
best possible way" with respect to the particular for- 
malisms. Also, incompatibilities of the underlying for- 
malisms render comparisons between different systems 
often difficult. 

Another possibility is to compare systems on problem 
classes common to each of the considered systems. Such 



a co nparison is appropriate if different implementations 



of the same formalism are evaluated. 

Since QUIP incorporates a wide array of different 
knowledge representation methods, comparisons with 



other implementations can in general be achieved em- 
ploying the latter procedure. 



Accepted benchmarks for nonmonotonic theorem 
provers have be en realized by the well-known Theo- 



ry B ase system ( Cholcwinski et al. 1995 ). This test- 



bed provides encodings of various graph problems in 
terms of default theories or equivalent logic programs. 
However, the generated problems are at most NP-hard 
(or co-NP-hard, depending on the reasoning task), and 
thus do not take full advantage of the expressibility 
supported by the nonmonotonic formalisms. The only 
practically applied benchmark test utilizing a £ 2 -hard 
problem is the strategic co mpanies example c arried out 
for testing the system dlv (Eiter et al. 1998). 

Here, we propose a straightforward method how E 2 - 
hard benchmark problems for prepositional nonmono- 
tonic reasoning formalisms can be generated. The idea 
is to use the class of problems establishing the £ 2 - 
hardness of a formalism. Recall from our previous dis- 



cussion that X 2 -hardness of a nonmonotonic formalism 
is usually demonstrated by constructing a (polynomial) 
transformation S mapping instances / of QSAT 2 (the 
evaluation problem of QBFs having quantifier order 3 V) 
into instances S{T) of the considered nonmonotonic rea- 
soning task, NMP, such that / is a yes-instance of QSAT 2 
iff S(I) is a yes-instance of NMP. Thus, in some sense, 
the class of problems S(I) represents "worst-case" ex- 
amples for the problem nmp, and therefore is partic- 
ularly useful estimating the performance of a theorem 
prover solving the task nmp. Moreover, these problems 
are easily scalable by parameterizing different instances 
of QSAT 2 . An added feature of QUIP is that these ex- 
amples provide at the same time a simple method for 
testing whether the implementation works correct, be- 
cause QUIP turns instances S(I) of NMP back to in- 
stances T(S(I)) of QSAT 2 , satisfying the condition that 
/ is a yes-instance of QSAT 2 iff T(S(I)) is a yes- instance 
of QSAT 2 . The next subsection describes comparisons 
between QUIP and some state-of-the-art provers on the 
basis of these benchmark problems. 

Comparison 

We co mpare the default- logic module of QUIP with 
DeReS (Cholcwinski, Marek, fe Truszcynski 19961) and 



the logic-pro gramming module of QUIP with dlv ( [Eiter 
1998), using the class of examples discussed 



et 



above. Space limits preclude a discussion on the struc- 
ture of these problems; details can be found in the rel- 
evant literature (e.g., ( Gottlob 1992 ;_ Eiter & Gottlob 



19931 ; [Eiter fc Gottlob 1995aj ; |Eiter fc Gottlob 1995b|)) 
We do not include a comparison with smodels ( Niemela 



& Simons 199E) here, because that system is currently 
not designed to handle E 2 -problems (a comparis on be- 



tween QUIP, DeReS, dlv, smodels, and Theorist (Poole 
using examples from T heoryBase and som e ab- 



duction problems, is given in ( Egly et al. 2000b )). 

Results of the comparison are given Table [IJ All tests 
have been performed on a SUN ULTRA 60 with 256MB 
RAM; the run-time is measured in seconds with an up- 
per limit of 90sec (i.e., instances requiring a longer pe- 
riod are not displayed). The first group of entries gives 
the results for the disjunctive logic programming test; 
the second group gives the results for the default logic 
test; and the final row contains some measurements us- 
ing instances of the corresponding abductive problem 
class. The respective input-QBFs have been randomly 
generated and are parameterized by the number k of ex- 
istential quantifiers (the number of variables was held 



fixed and was set to 20). Although the given results 
represent only a small sample, they do indicate that 
our ad hoc implementation performs sufficiently well. 

Problem Size 

ft is rather obvious that QUIP cannot compete with 
state-of-the-art implementations like dlv or smodels in 
terms of problem size. These tools are highly optimized 
systems developed with a particular semantics in mind, 
whereas the purpose of QUIP is to provide a uniform 
method dealing with several knowledge representation 
tasks at the same time. Under this perspective, and 
taking into account that QUIP utilizes at present no 
optimizations whatsoever, our results demonstrate that 
implementing nonmonotonic reasoning formalisms us- 
ing reductions to quantified boolean formulae is a fea- 
sible approach. Moreover, the modular architecture of 
QUIP allows an easy scalability and parallelization, by 
using, e.g., several QBF-provers simultaneously, each of 
which with its own optimization method. 
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